Metrics for comparing regulatory sequences on the basis of pattern counts
نویسنده
چکیده
MOTIVATION Upstream sequences contain short motifs, which mediate transcriptional regulation by specifically binding different transcription factors. The presence of common motifs in the regulatory regions of two genes might be considered as a clue for a potential co-regulation. A pattern count-based (dis)similarity metric between sequences could thus be used to classify genes according to their putative regulatory properties. RESULTS We present here several metrics which rely on probability theory, and which aim at comparing sequences on the basis of pattern counts. We compare these metrics to several classical dissimilarity and similarity metrics, and illustrate their behaviour with a biological example.
منابع مشابه
Combining pattern discovery and discriminant analysis to predict gene co-regulation
MOTIVATION Several pattern discovery methods have been proposed to detect over-represented motifs in upstream sequences of co-regulated genes, and are for example used to predict cis-acting elements from clusters of co-expressed genes. The clusters to be analyzed are often noisy, containing a mixture of co-regulated and non-co-regulated genes. We propose a method to discriminate co-regulated fr...
متن کاملDetection and Classification of Breast Cancer in Mammography Images Using Pattern Recognition Methods
Introduction: In this paper, a method is presented to classify the breast cancer masses according to new geometric features. Methods: After obtaining digital breast mammogram images from the digital database for screening mammography (DDSM), image preprocessing was performed. Then, by using image processing methods, an algorithm was developed for automatic extracting of masses from other norma...
متن کاملDetection and Classification of Breast Cancer in Mammography Images Using Pattern Recognition Methods
Introduction: In this paper, a method is presented to classify the breast cancer masses according to new geometric features. Methods: After obtaining digital breast mammogram images from the digital database for screening mammography (DDSM), image preprocessing was performed. Then, by using image processing methods, an algorithm was developed for automatic extracting of masses from other norma...
متن کاملHigh Fuzzy Utility Based Frequent Patterns Mining Approach for Mobile Web Services Sequences
Nowadays high fuzzy utility based pattern mining is an emerging topic in data mining. It refers to discover all patterns having a high utility meeting a user-specified minimum high utility threshold. It comprises extracting patterns which are highly accessed in mobile web service sequences. Different from the traditional fuzzy approach, high fuzzy utility mining considers not only counts of mob...
متن کاملA comparative study of the spatial pattern in rural settlements; The case study of Shalheh-ye Emam Hasan of Abadan and Hadbeh khanafereh of Shadegan
The village is a long-standing form of human settlement that embodies the vernacular pattern of human habitation and innovative ways of adapting to the environment. Village dwellers are the creators of unique cultural landscapes which are looked upon as the heritage for future generations. Therefore recognizing the dominant patterns in rural landscapes has always been of interest to researchers...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
عنوان ژورنال:
- Bioinformatics
دوره 20 3 شماره
صفحات -
تاریخ انتشار 2004